Search | WHO COVID-19 Research Database

1.

Web Resources for SARS-CoV-2 Genomic Database, Annotation, Analysis and Variant Tracking.

Cheng, Yexiao; Ji, Chengyang; Zhou, Hang-Yu; Zheng, Heng; Wu, Aiping.

Viruses ; 15(5)2023 05 12.

Article in English | MEDLINE | ID: covidwho-20234105

ABSTRACT

The SARS-CoV-2 genomic data continue to grow, providing valuable information for researchers and public health officials. Genomic analysis of these data sheds light on the transmission and evolution of the virus. To aid in SARS-CoV-2 genomic analysis, many web resources have been developed to store, collate, analyze, and visualize the genomic data. This review summarizes web resources used for the SARS-CoV-2 genomic epidemiology, covering data management and sharing, genomic annotation, analysis, and variant tracking. The challenges and further expectations for these web resources are also discussed. Finally, we highlight the importance and need for continued development and improvement of related web resources to effectively track the spread and understand the evolution of the virus.

Subject(s)

COVID-19 , Humans , COVID-19/epidemiology , SARS-CoV-2/genetics , Genomics , Public Health , Research Personnel

2.

Genomic evidence for divergent co-infections of co-circulating SARS-CoV-2 lineages.

Zhou, Hang-Yu; Cheng, Ye-Xiao; Xu, Lin; Li, Jia-Ying; Tao, Chen-Yue; Ji, Cheng-Yang; Han, Na; Yang, Rong; Wu, Hui; Li, Yaling; Wu, Aiping.

Comput Struct Biotechnol J ; 20: 4015-4024, 2022.

Article in English | MEDLINE | ID: covidwho-2288930

ABSTRACT

Co-infection of RNA viruses may contribute to their recombination and cause severe clinical symptoms. However, the tracking and identification of SARS-CoV-2 co-infection persist as challenges. Due to the lack of methods for detecting co-infected samples in a large amount of deep sequencing data, the lineage composition, spatial-temporal distribution, and frequency of SARS-CoV-2 co-infection events in the population remains unclear. Here, we propose a hypergeometric distribution-based method named Cov2Coinfect with the ability to decode the lineage composition from 50,809 deep sequencing data. By resolving the mutational patterns in each sample, Cov2Coinfect can precisely determine the co-infected SARS-CoV-2 variants from deep sequencing data. Results from two independent and parallel projects in the United States achieved a similar co-infection rate of 0.3-0.5 % in SARS-CoV-2 positive samples. Notably, all co-infected variants were highly consistent with the co-circulating SARS-CoV-2 lineages in the regional epidemiology, demonstrating that the co-circulation of different variants is an essential prerequisite for co-infection. Overall, our study not only provides a robust method to identify the co-infected SARS-CoV-2 variants from sequencing samples, but also highlights the urgent need to pay more attention to co-infected patients for better disease prevention and control.

3.

covSampler: A subsampling method with balanced genetic diversity for large-scale SARS-CoV-2 genome data sets.

Cheng, Yexiao; Ji, Chengyang; Han, Na; Li, Jiaying; Xu, Lin; Chen, Ziyi; Yang, Rong; Zhou, Hang-Yu; Wu, Aiping.

Virus Evol ; 8(2): veac071, 2022.

Article in English | MEDLINE | ID: covidwho-2107592

ABSTRACT

Phylogenetic analysis has been widely used to describe, display, and infer the evolutionary patterns of viruses. The unprecedented accumulation of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genomes has provided valuable materials for the real-time study of SARS-CoV-2 evolution. However, the large number of SARS-CoV-2 genome sequences also poses great challenges for data analysis. Several methods for subsampling these large data sets have been introduced. However, current methods mainly focus on the spatiotemporal distribution of genomes without considering their genetic diversity, which might lead to post-subsampling bias. In this study, a subsampling method named covSampler was developed for the subsampling of SARS-CoV-2 genomes with consideration of both their spatiotemporal distribution and their genetic diversity. First, covSampler clusters all genomes according to their spatiotemporal distribution and genetic variation into groups that we call divergent pathways. Then, based on these divergent pathways, two kinds of subsampling strategies, representative subsampling and comprehensive subsampling, were provided with adjustable parameters to meet different users' requirements. Our performance and validation tests indicate that covSampler is efficient and stable, with an abundance of options for user customization. Overall, our work has developed an easy-to-use tool and a webserver (https://www.covsampler.net) for the subsampling of SARS-CoV-2 genome sequences.

4.

covSampler: A Subsampling Method with Balanced Genetic Diversity for Large-Scale SARS-CoV-2 Genome Data Sets

Cheng, Yexiao, Ji, Chengyang, Han, Na, Li, Jiaying, Xu, Lin, Chen, Ziyi, Yang, Rong, Zhou, Hang-Yu, Wu, Aiping.

Virus evolution ; 2022.

Article in English | EuropePMC | ID: covidwho-1998565

ABSTRACT

Phylogenetic analysis has been widely used to describe, display and infer the evolutionary patterns of viruses. The unprecedented accumulation of SARS-CoV-2 genomes has provided valuable materials for the real-time study of SARS-CoV-2 evolution. However, the large number of SARS-CoV-2 genome sequences also poses great challenges for data analysis. Several methods for subsampling these large data sets have been introduced. However, current methods mainly focus on the spatiotemporal distribution of genomes without considering their genetic diversity, which might lead to postsubsampling bias. In this study, a subsampling method named covSampler was developed for the subsampling of SARS-CoV-2 genomes with consideration of both their spatiotemporal distribution and their genetic diversity. First, covSampler clusters all genomes according to their spatiotemporal distribution and genetic variation into groups that we call divergent pathways. Then, based on these divergent pathways, two kinds of subsampling strategies, representative subsampling and comprehensive subsampling, were provided with adjustable parameters to meet different users’ requirements. Our performance and validation tests indicate that covSampler is efficient and stable, with an abundance of options for user customization. Overall, our work has developed an easy-to-use tool and a webserver (https://www.covsampler.net) for the subsampling of SARS-CoV-2 genome sequences.

5.

Detecting Potentially Adaptive Mutations from the Parallel and Fixed Patterns in SARS-CoV-2 Evolution.

Ji, Cheng-Yang; Han, Na; Cheng, Ye-Xiao; Shang, Jingzhe; Weng, Shenghui; Yang, Rong; Zhou, Hang-Yu; Wu, Aiping.

Viruses ; 14(5)2022 05 18.

Article in English | MEDLINE | ID: covidwho-1903490

ABSTRACT

Early identification of adaptive mutations could provide timely help for the control and prevention of the COVID-19 pandemic. The fast accumulation of SARS-CoV-2 sequencing data provides important support, while also raising a great challenge for the recognition of adaptive mutations. Here, we proposed a computational strategy to detect potentially adaptive mutations from their fixed and parallel patterns in the phylogenetic trajectory. We found that the biological meanings of fixed substitution and parallel mutation are highly complementary, and can reasonably be integrated as a fixed and parallel (paraFix) mutation, to identify potentially adaptive mutations. Tracking the dynamic evolution of SARS-CoV-2, 37 sites in spike protein were identified as having experienced paraFix mutations. Interestingly, 70% (26/37) of them have already been experimentally confirmed as adaptive mutations. Moreover, most of the mutations could be inferred as paraFix mutations one month earlier than when they became regionally dominant. Overall, we believe that the concept of paraFix mutations will help researchers to identify potentially adaptive mutations quickly and accurately, which will provide invaluable clues for disease control and prevention.

Subject(s)

COVID-19 , SARS-CoV-2 , Humans , Mutation , Pandemics , Phylogeny , SARS-CoV-2/genetics

6.

Detecting Potentially Adaptive Mutations from the Parallel and Fixed Patterns in SARS-CoV-2 Evolution

Ji, Cheng-Yang, Han, Na, Cheng, Ye-Xiao, Shang, Jingzhe, Weng, Shenghui, Yang, Rong, Zhou, Hang-Yu, Wu, Aiping.

Viruses ; 14(5):1087, 2022.

Article in English | MDPI | ID: covidwho-1857303

ABSTRACT

Early identification of adaptive mutations could provide timely help for the control and prevention of the COVID-19 pandemic. The fast accumulation of SARS-CoV-2 sequencing data provides important support, while also raising a great challenge for the recognition of adaptive mutations. Here, we proposed a computational strategy to detect potentially adaptive mutations from their fixed and parallel patterns in the phylogenetic trajectory. We found that the biological meanings of fixed substitution and parallel mutation are highly complementary, and can reasonably be integrated as a fixed and parallel (paraFix) mutation, to identify potentially adaptive mutations. Tracking the dynamic evolution of SARS-CoV-2, 37 sites in spike protein were identified as having experienced paraFix mutations. Interestingly, 70% (26/37) of them have already been experimentally confirmed as adaptive mutations. Moreover, most of the mutations could be inferred as paraFix mutations one month earlier than when they became regionally dominant. Overall, we believe that the concept of paraFix mutations will help researchers to identify potentially adaptive mutations quickly and accurately, which will provide invaluable clues for disease control and prevention.

7.

Genetic differentiation and diversity of SARS-CoV-2 Omicron variant in its early outbreak.

Weng, Shenghui; Shang, Jingzhe; Cheng, Yexiao; Zhou, Hangyu; Ji, Chengyang; Yang, Rong; Wu, Aiping.

Biosaf Health ; 4(3): 171-178, 2022 Jun.

Article in English | MEDLINE | ID: covidwho-1803615

ABSTRACT

The recently emerged Omicron variant of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has quickly spread around the world. Although many consensus mutations of the Omicron variant have been recognized, little is known about its genetic variation during its transmission in the population. Here, we comprehensively analyzed the genetic differentiation and diversity of the Omicron variant during its early outbreak. We found that Omicron achieved more structural variations, especially deletions, on the SARS-CoV-2 genome than the other four variants of concern (Alpha, Beta, Gamma, and Delta) in the same timescale. In addition, the Omicron variant acquired, except for 50 consensus mutations, seven great new non-synonymous nucleotide substitutions during its spread. Three of them are on the S protein, including S_A701V, S_L1081V, and S_R346K, which belong to the receptor-binding domain (RBD). The Omicron BA.1 branch could be divided into five divergent groups spreading across different countries and regions based on these seven novel mutations. Furthermore, we found that the Omicron variant possesses more mutations related to a faster transmission rate than the other SARS-CoV-2 variants by assessing the relationship between the genetic diversity and transmission rate. The findings indicated that more attention should be paid to the significant genetic differentiation and diversity of the Omicron variant for better disease prevention and control.

8.

Conserved Pattern and Potential Role of Recurrent Deletions in SARS-CoV-2 Evolution.

Weng, Shenghui; Zhou, Hangyu; Ji, Chengyang; Li, Liang; Han, Na; Yang, Rong; Shang, Jingzhe; Wu, Aiping.

Microbiol Spectr ; 10(2): e0219121, 2022 04 27.

Article in English | MEDLINE | ID: covidwho-1731263

ABSTRACT

SARS-CoV-2 continues adapting to human hosts during the current worldwide pandemic since 2019. This virus evolves through multiple means, such as single nucleotide mutations and structural variations, which has brought great difficulty to disease prevention and control of COVID-19. Structural variation, including multiple nucleotide changes like insertions and deletions, has a greater impact relative to single nucleotide mutation on both genome structures and protein functions. In this study, we found that deletion occurred frequently in not only SARS-CoV-2 but also in other SARS-related coronaviruses. These deletions showed obvious location bias and formed 45 recurrent deletion regions in the viral genome. Some of these deletions showed proliferation advantages, including four high-frequency deletions (nsp6 Δ106-109, S Δ69-70, S Δ144, and Δ28271) that were detected in around 50% of SARS-CoV-2 genomes and other 19 median-frequency deletions. In addition, the association between deletions and the WHO reported variants of concern (VOC) and variants of interest (VOI) of SARS-CoV-2 indicated that these variants had a unique combination of deletion patterns. In the spike (S) protein, the deletions in SARS-CoV-2 were mainly in the N-terminal domain. Some deletions, such as S Δ144/145 and S Δ243-244, have been confirmed to block the binding sites of neutralizing antibodies. Overall, this study revealed a conservative regional pattern and the potential effect of some deletions in SARS-CoV-2 over the whole genome, providing important evidence for potential epidemic control and vaccine development. IMPORTANCE Mutations in SARS-CoV-2 were studied extensively, while only the structure variations on the spike protein were discussed well in previous studies. To study the role of structural variations in virus evolution, we described the distribution of structure variations on the whole genome. Conserved patterns were found of deletions among SARS-CoV-2, SARS-CoV-2-like, and SARS-CoV-like viruses. There were 45 recurrent deletion regions (RDRs) in SARS-CoV-2 generated through the integration of deleted positions. In these regions, four high-frequency deletions parallelly appeared in multiple strains. Furthermore, in the spike protein, the deletions in SARS-CoV-2 were mainly in the N-terminal domain, blocking the binding sites of some neutralizing antibodies, while the structural variations in SARS-related coronavirus were mainly in the N-terminal domain and receptor binding domain. The receptor binding domain is highly related to hosting recognition. The deletions in the receptor binding domain may play a role in host adaption.

Subject(s)

COVID-19 , SARS-CoV-2 , Antibodies, Neutralizing , COVID-19/epidemiology , Humans , Mutation , Nucleotides , SARS-CoV-2/genetics , Spike Glycoprotein, Coronavirus

9.

Convergent evolution of SARS-CoV-2 in human and animals.

Zhou, Hang-Yu; Ji, Cheng-Yang; Fan, Hang; Han, Na; Li, Xiao-Feng; Wu, Aiping; Qin, Cheng-Feng.

Protein Cell ; 12(11): 832-835, 2021 11.

Article in English | MEDLINE | ID: covidwho-1384667

Subject(s)

COVID-19/pathology , Evolution, Molecular , SARS-CoV-2/genetics , Animals , COVID-19/virology , Disease Models, Animal , Humans , Polymorphism, Single Nucleotide , SARS-CoV-2/isolation & purification , SARS-CoV-2/metabolism , Spike Glycoprotein, Coronavirus/genetics

10.

Compositional diversity and evolutionary pattern of coronavirus accessory proteins.

Shang, Jingzhe; Han, Na; Chen, Ziyi; Peng, Yousong; Li, Liang; Zhou, Hangyu; Ji, Chengyang; Meng, Jing; Jiang, Taijiao; Wu, Aiping.

Brief Bioinform ; 22(2): 1267-1278, 2021 03 22.

Article in English | MEDLINE | ID: covidwho-1343631

ABSTRACT

Accessory proteins play important roles in the interaction between coronaviruses and their hosts. Accordingly, a comprehensive study of the compositional diversity and evolutionary patterns of accessory proteins is critical to understanding the host adaptation and epidemic variation of coronaviruses. Here, we developed a standardized genome annotation tool for coronavirus (CoroAnnoter) by combining open reading frame prediction, transcription regulatory sequence recognition and homologous alignment. Using CoroAnnoter, we annotated 39 representative coronavirus strains to form a compositional profile for all of the accessary proteins. Large variations were observed in the number of accessory proteins of 1-10 for different coronaviruses, with SARS-CoV-2 and SARS-CoV having the most (9 and 10, respectively). The variation between SARS-CoV and SARS-CoV-2 accessory proteins could be traced back to related coronaviruses in other hosts. The genomic distribution of accessory proteins had significant intra-genus conservation and inter-genus diversity and could be grouped into 1, 4, 2 and 1 types for alpha-, beta-, gamma-, and delta-coronaviruses, respectively. Evolutionary analysis suggested that accessory proteins are more conservative locating before the N-terminal of proteins E and M (E-M), while they are more diverse after these proteins. Furthermore, comparison of virus-host interaction networks of SARS-CoV-2 and SARS-CoV accessory proteins showed that they share multiple antiviral signaling pathways, those involved in the apoptotic process, viral life cycle and response to oxidative stress. In summary, our study provides a tool for coronavirus genome annotation and builds a comprehensive profile for coronavirus accessory proteins covering their composition, classification, evolutionary pattern and host interaction.

Subject(s)

Biological Evolution , COVID-19/virology , SARS-CoV-2/metabolism , Viral Proteins/genetics , Viral Proteins/metabolism , Genes, Viral , Humans , Molecular Sequence Annotation , Open Reading Frames , Protein Interaction Maps , SARS-CoV-2/genetics

11.

One year of SARS-CoV-2 evolution.

Wu, Aiping; Wang, Lulan; Zhou, Hang-Yu; Ji, Cheng-Yang; Xia, Shang Zhou; Cao, Yang; Meng, Jing; Ding, Xiao; Gold, Sarah; Jiang, Taijiao; Cheng, Genhong.

Cell Host Microbe ; 29(4): 503-507, 2021 04 14.

Article in English | MEDLINE | ID: covidwho-1309185

ABSTRACT

Since the outbreak of SARS-CoV-2, the etiologic agent of the COVID-19 pandemic, the viral genome has acquired numerous mutations with the potential to increase transmission. One year after its emergence, we now further analyze emergent SARS-CoV-2 genome sequences in an effort to understand the evolution of this virus.

Subject(s)

COVID-19/virology , Evolution, Molecular , Genome, Viral , Mutation , SARS-CoV-2/genetics , COVID-19/immunology , Humans

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL